AITopics | Dartmouth

Collaborating Authors

Dartmouth

41bd71e7bf7f9fe68f1c936940fd06bd-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 23:55:55 GMT

inference, neural network, non-linear layer, (17 more...)

Neural Information Processing Systems

Country:

Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > Washington > King County > Redmond (0.04)
North America > United States > North Carolina (0.04)
(4 more...)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference

Neural Information Processing SystemsOct-8-2025, 13:30:27 GMT

The deployment of GCNs in the cloud raises privacy concerns due to potential adversarial attacks on client data. To address security concerns, Privacy-Preserving Machine Learning (PPML) using Homo-morphic Encryption (HE) secures sensitive client data.

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > Washington > King County > Redmond (0.04)
North America > United States > North Carolina (0.04)
(4 more...)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining (0.87)

Add feedback

S$^2$GPT-PINNs: Sparse and Small models for PDEs

Ji, Yajie, Chen, Yanlai, Koohy, Shawn

arXiv.org Machine LearningJun-23-2025

We propose S$^2$GPT-PINN, a sparse and small model for solving parametric partial differential equations (PDEs). Similar to Small Language Models (SLMs), S$^2$GPT-PINN is tailored to domain-specific (families of) PDEs and characterized by its compact architecture and minimal computational power. Leveraging a small amount of extremely high quality data via a mathematically rigorous greedy algorithm that is enabled by the large full-order models, S$^2$GPT-PINN relies on orders of magnitude less parameters than PINNs to achieve extremely high efficiency via two levels of customizations. The first is knowledge distillation via task-specific activation functions that are transferred from Pre-Trained PINNs. The second is a judicious down-sampling when calculating the physics-informed loss of the network compressing the number of data sites by orders of magnitude to the size of the small model.

artificial intelligence, gpt-pinn, machine learning, (19 more...)

arXiv.org Machine Learning

2506.15687

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
North America > United States > Massachusetts > Bristol County > Dartmouth (0.14)
Asia > China > Shanghai > Shanghai (0.04)
Europe > Portugal > Braga > Braga (0.04)

Genre: Research Report (0.40)

Industry:

Education (0.46)
Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Adaptive Pruning of Deep Neural Networks for Resource-Aware Embedded Intrusion Detection on the Edge

Broggi, Alexandre, Bastian, Nathaniel, Fiondella, Lance, Kul, Gokhan

arXiv.org Artificial IntelligenceMay-21-2025

Artificial neural network pruning is a method in which artificial neural network sizes can be reduced while attempting to preserve the predicting capabilities of the network. This is done to make the model smaller or faster during inference time. In this work we analyze the ability of a selection of artificial neural network pruning methods to generalize to a new cybersecurity dataset utilizing a simpler network type than was designed for. We analyze each method using a variety of pruning degrees to best understand how each algorithm responds to the new environment. This has allowed us to determine the most well fit pruning method of those we searched for the task. Unexpectedly, we have found that many of them do not generalize to the problem well, leaving only a few algorithms working to an acceptable degree.

artificial intelligence, machine learning, pruning, (16 more...)

arXiv.org Artificial Intelligence

2505.14592

Country: North America > United States > Massachusetts > Bristol County > Dartmouth (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Replay to Remember: Retaining Domain Knowledge in Streaming Language Models

Pillai, Sneh

arXiv.org Artificial IntelligenceApr-25-2025

Traditional fine-tuning methods, while effective, often require substantial computational resources and large, static datasets, making them impractical for real-time applications. Moreover, these models notoriously suffer from catastrophic forgetting, rapid performance degradation on previously learned tasks when presented with new data (Luo et al., 2023). Recent literature addresses catastrophic forgetting via techniques such as replay buffers, which periodically reintroduce previously learned data, and Low-Rank Adaptation (LoRA), a parameter-efficient fine-tuning approach designed to reduce computational overhead (Smith & Jones, 2024; Hu et al., 2021). Although these methods individually show promise, there remains a notable gap in understanding their efficacy and interaction within real-time, streaming learning environments. In this work, we bridge this gap by integrating LoRA with a lightweight replay mechanism under stringent streaming constraints, simulating real-world conditions where models must continually adapt using limited computational resources and data batches. We focus specifically on three distinct domains,medical, genetic, and legal,to evaluate the generalizability and robustness of our approach.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2504.1778

Country: North America > United States > Massachusetts > Bristol County > Dartmouth (0.14)

Genre: Research Report > New Finding (0.94)

Industry: Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Multi-Modality Sensing in mmWave Beamforming for Connected Vehicles Using Deep Learning

Mollah, Muhammad Baqer, Wang, Honggang, Karim, Mohammad Ataul, Fang, Hua

arXiv.org Artificial IntelligenceApr-9-2025

Beamforming techniques are considered as essential parts to compensate severe path losses in millimeter-wave (mmWave) communications. In particular, these techniques adopt large antenna arrays and formulate narrow beams to obtain satisfactory received powers. However, performing accurate beam alignment over narrow beams for efficient link configuration by traditional standard defined beam selection approaches, which mainly rely on channel state information and beam sweeping through exhaustive searching, imposes computational and communications overheads. And, such resulting overheads limit their potential use in vehicle-to-infrastructure (V2I) and vehicle-to-vehicle (V2V) communications involving highly dynamic scenarios. In comparison, utilizing out-of-band contextual information, such as sensing data obtained from sensor devices, provides a better alternative to reduce overheads. This paper presents a deep learning-based solution for utilizing the multi-modality sensing data for predicting the optimal beams having sufficient mmWave received powers so that the best V2I and V2V line-of-sight links can be ensured proactively. The proposed solution has been tested on real-world measured mmWave sensing and communication data, and the results show that it can achieve up to 98.19% accuracies while predicting top-13 beams. Correspondingly, when compared to existing been sweeping approach, the beam sweeping searching space and time overheads are greatly shortened roughly by 79.67% and 91.89%, respectively which confirm a promising solution for beamforming in mmWave enabled communications.

artificial intelligence, communication, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TCCN.2025.3558026

2504.06173

Country: North America > United States > Massachusetts > Bristol County > Dartmouth (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology (1.00)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Censoring-Aware Tree-Based Reinforcement Learning for Estimating Dynamic Treatment Regimes with Censored Outcomes

Paul, Animesh Kumar, Greiner, Russell

arXiv.org Artificial IntelligenceMar-9-2025

Dynamic Treatment Regimes (DTRs) provide a systematic approach for making sequential treatment decisions that adapt to individual patient characteristics, particularly in clinical contexts where survival outcomes are of interest. Censoring-Aware Tree-Based Reinforcement Learning (CA-TRL) is a novel framework to address the complexities associated with censored data when estimating optimal DTRs. We explore ways to learn effective DTRs, from observational data. By enhancing traditional tree-based reinforcement learning methods with augmented inverse probability weighting (AIPW) and censoring-aware modifications, CA-TRL delivers robust and interpretable treatment strategies. We demonstrate its effectiveness through extensive simulations and real-world applications using the SANAD epilepsy dataset, where it outperformed the recently proposed ASCL method in key metrics such as restricted mean survival time (RMST) and decision-making accuracy. This work represents a step forward in advancing personalized and data-driven treatment strategies across diverse healthcare settings.

dynamic treatment regime, survival time, treatment assignment, (10 more...)

arXiv.org Artificial Intelligence

2503.0669

Country:

North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.93)

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Graph-Augmented LSTM for Forecasting Sparse Anomalies in Graph-Structured Time Series

Pillai, Sneh

arXiv.org Artificial IntelligenceMar-5-2025

Anomaly detection in time series data is a well-studied problem due to its importance in detecting faults, intrusions, and unusual events in critical systems [1, 3]. Extensive surveys have reviewed methods for general anomaly detection [1], outlier analysis [3], and specifically for temporal data [4]. Despite this progress, accurately identifying anomalies in time series remains challenging [14]. A key difficulty is that anomalies are often sparse--comprising only a tiny fraction of observations [2]. This extreme class imbalance makes it hard for models to recognize anomalies without producing many false alarms [6]. One strategy to detect anomalies is to forecast future behavior and flag deviations between predictions and actual values [15, 16]. Classical forecasting models, such as ARIMA [12] and exponential smoothing, as well as decomposition-based methods like Prophet [13], have been applied to model normal time series patterns and identify outliers when residuals exceed a threshold. Numerous other approaches leverage deep generative models (e.g., variational autoencoders [17], GANs [18]) or attention mechanisms [19] to improve multivariate time series anomaly detection. However, most prior methods treat multivariate time series as an unstructured collection of variables, not accounting for known relationships among them.

anomaly, anomaly detection, graph-augmented lstm, (14 more...)

arXiv.org Artificial Intelligence

2503.03729

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.26)
North America > United States > Massachusetts > Bristol County > Dartmouth (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.04)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GO: The Great Outdoors Multimodal Dataset

Jiang, Peng, Viswanath, Kasi, Nagariya, Akhil, Chustz, George, Wigness, Maggie, Osteen, Philip, Overbye, Timothy, Ellis, Christian, Quang, Long, Saripalli, Srikanth

arXiv.org Artificial IntelligenceJan-31-2025

The Great Outdoors (GO) dataset is a multi-modal annotated data resource aimed at advancing ground robotics research in unstructured environments. This dataset provides the most comprehensive set of data modalities and annotations compared to existing off-road datasets. In total, the GO dataset includes six unique sensor types with high-quality semantic annotations and GPS traces to support tasks such as semantic segmentation, object detection, and SLAM. The diverse environmental conditions represented in the dataset present significant real-world challenges that provide opportunities to develop more robust solutions to support the continued advancement of field robotics, autonomous exploration, and perception systems in natural environments. The dataset can be downloaded at: https://www.unmannedlab.org/the-great-outdoors-dataset/

artificial intelligence, machine learning, university, (15 more...)

arXiv.org Artificial Intelligence

2501.19274

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Massachusetts > Bristol County > Dartmouth (0.04)
North America > United States > Maryland > Prince George's County > Adelphi (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry:

Automobiles & Trucks (0.94)
Government (0.71)
Transportation > Ground > Road (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.47)

Add feedback

DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning

Dooney, Tom, Narola, Harsh, Bromuri, Stefano, Curier, R. Lyana, Broeck, Chris Van Den, Caudill, Sarah, Tan, Daniel Stanley

arXiv.org Artificial IntelligenceJan-30-2025

Gravitational wave (GW) interferometers, detect faint signals from distant astrophysical events, such as binary black hole mergers. However, their high sensitivity also makes them susceptible to background noise, which can obscure these signals. This noise often includes transient artifacts called "glitches" that can mimic astrophysical signals or mask their characteristics. Fast and accurate reconstruction of both signals and glitches is crucial for reliable scientific inference. In this study, we present DeepExtractor, a deep learning framework designed to reconstruct signals and glitches with power exceeding interferometer noise, regardless of their source. We design DeepExtractor to model the inherent noise distribution of GW interferometers, following conventional assumptions that the noise is Gaussian and stationary over short time scales. It operates by predicting and subtracting the noise component of the data, retaining only the clean reconstruction. Our approach achieves superior generalization capabilities for arbitrary signals and glitches compared to methods that directly map inputs to the clean training waveforms. We validate DeepExtractor's effectiveness through three experiments: (1) reconstructing simulated glitches injected into simulated detector noise, (2) comparing performance with the state-of-the-art BayesWave algorithm, and (3) analyzing real data from the Gravity Spy dataset to demonstrate effective glitch subtraction from LIGO strain data. DeepExtractor achieves a median mismatch of only 0.9% for simulated glitches, outperforming several deep learning baselines. Additionally, DeepExtractor surpasses BayesWave in glitch recovery, offering a dramatic computational speedup by reconstructing one glitch sample in approx. 0.1 seconds on a CPU, compared to BayesWave's processing time of approx. one hour per glitch.

artificial intelligence, glitch, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2501.18423

Country:

North America > United States > Massachusetts > Bristol County > Dartmouth (0.14)
Asia > Japan (0.04)
Europe > United Kingdom (0.04)
(12 more...)

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine > Diagnostic Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback